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Normally the issue or question of the time of arrival of light rays at an observer coming from a 
given source is associated with Fermat's Principle of Least Time which yields paths of extremal time. 
We here investigate a related but different problem. We consider an observer receiving light from 
an extended source that has propagated in an arbitrary gravitational field. It is assumed from the 
start that the propagation is along null geodesies. Each point of the extended source is sending out 
a light-cones worth of null rays and the question arises which null rays from the source arrive first 
at the observer. Stated in an a different fashion, a pulse of light comes from the source with a wave- 
front as the leading edge, which rays are associated with that leading edge. In vacuum flat-space we 
have, from Huygen's principle, that the rays normal to the source constitute the leading edge and 
hence arrive first at an observer. We here investigate this issue in the presence of a gravitational 
field. Though it is not obvious, since the rays bend and are focused by the gravitational field and 
could even cross, in fact it is the normal rays that arrive earliest. We give two proofs both involving 
the extemization of the time of arrival, one based on an idea of Schrodinger for the derivation of 
' gravitational frequency shifts and the other based on V.I. Arnold's theory of generating families. 

. Dedicated to Jayant Narlikar. 
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I. INTRODUCTION 

> 

— ■ If one considers the problem of the "time of arrival" of a light signal coming from an extended source (a two- 
\& , surface) and arriving at an observer who is moving along an arbitary time-like world-line, one normally invokes 
Fermat's Principle of Least Time to determine the path of the light-ray from each point on the source to the observer. 
We will discuss a different but related issue - at each source point there is a light-cones worth of rays that are emitted 
and the question is what are the directions of the rays for them to have the earliest "time of arrival" at the observer. 
Stated in a different manner, if the source suddenly lights-up, then from each point there will be a sphere's worth of 
rays leaving it resulting in a pulse arriving at the observer from different source points with a leading edge followed by 
a tail. The question then is which rays arrive earliest? From the start we assume that the paths of the rays are given 
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by null geodesies, so that there is no need to invoke Fermat's Principle. In flat vacuum space-time it is clear that it 
■ is the normal rays that arrive earliest, i.e., they travel the shortest distance. However in a Lorentzian manifold with 
rr| a time varying metric the issue is not as obvious. The problems are that there is no well defined distance between 
source and observer, the rays can bend and be focused by the field and could even cross or even more than one ray 
KJ{ ' could arrive at the observer from the same source point. Nevertheless we will show that it is the rays that are emitted 
normal to the source that are the ones that arrive earliest, i.e., the time of arrival function is extremized by the normal 
& ' rays. 

We will give two alternative proofs of this. The first proof, given in Sec. II, involves a modification of the beautiful 
derivation by Schrodinger of the gravitational frequency shift while the second proof, in Sec. Ill, uses the theory of 
generating families of V.I. Arnold. 

Probably the most powerful technique for the study of wavefronts and their related characteristic (or null) surfaces 
in Lorentzian space-times and the associated difficulties in the analytic description of the development of caustics 
and crossover regions is Arnold's theory of Lagrangian and Legendre submanifolds and the associated Lagrange and 
Legendre maps PJ . One of the main ingredients in this theory is the construction of what has been referred to as 
generating families. They are, in general, two-point functions, F(x,s) (chosen from, perhaps, different spaces, x and s, 
with perhaps different dimensions), that are to be constructed from physical arguments and which are stationary with 
respect to variations in one of the two different spaces. In our case we give an example of this construction where the 
two-point function is the time-of-arrival function of light rays which begin from points on a two-surface, embedded in 
a four dimensional space-time, (thought of as a source of radiation), and which end at points on a one-dimensional 
manifold, a curve (thought of as the world-line of an observer of that radiation) also embedded in the same four-space. 
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Though we will not amplify on it here, this example appears to be, in principle if not in practice, of generic use in 
the theory of gravitational lensing in any Lorentzian space-time. 

II. THE TIME OF ARRIVAL FUNCTION 

We are concerned with the travel time of light signals from an extended source to a localized observer. For our 
purposes, the source lights up instantaneously, in its own rest frame, emitting photons in all directions from every 
point on its (closed) surface. There is one photon that arrives first at the observer's location, in the observer's 
proper time. If the metric is stationary, then this photon is, intuitively, the one that takes the shortest spatial path, 
perpendicularly to the surface of the source. In the following, we make these notions more precise, extending them 
to the case of arbitrary metrics. 

Consider, in an arbitrary Lorentzian four-dimensional manifold, a given closed space-like two-surface, S, described 

by 

x a = xZ(s\s 2 ) (1) 

where x a are space-time coordinates in the neighborhood of the source, and s J = (s^s 2 ) parametrize the surface 
S. In addition, consider a timclike worldlinc, C. In the neighborhood of the worldline, with no loss of generality, let 
the local coordinates be such that C is given by (t, X 1 ) where the X % are three constants, the spatial location of the 
observer, and r is the proper time along the worldline. 

From each point s' 7 of 5, construct its future lightcone, C s j . In general, in the absence of horizons, the line C 
intersects each C s j at least once. The intersection takes place at a particular value of the proper time r for each point 
s J on the surface. This means that there is a two-point function 

t = T{X\s j ). (2) 

that represents the proper time of arrival at C of light signals from S. One explicit way of constructing such a function 
is as follows. The lightcone C s j is foliated by light-rays from the point Xq(s j ), which are solutions 

z a =7 a (r; S J ,M) (3) 

of the geodesic equation with initial data labeled by the initial point s J and the initial direction (9, 4>) of the ray. 
Here r can be thought of as an affine parameter along the null geodesies. The intersection of the lightcone with the 
worldline C takes place at points where 

1 i (r;s J ,9,cf>)=X i (4) 
and the time (x a = r) at which the lightray reaches the observer is 

T = 7 °(r; S J ,M) (5) 

where the values of (r; s J , 9, </>) in the right-hand side are restricted by (f|). In cases where (||) is invertible for every 
value of s J , it provides (r, 9, (f) as functions of (X 1 , s' 7 ), which can be inserted into (|5|) to yield (||). 
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FIG. 1. Origin of the multiplicity of the time of arrival function. Due to the folding of the lightcone in the presence of 
curvature, there are several null geodesies that reach the same spatial location, at different times. In our picture, three null 
geodesies, leaving xq in different directions at the same time, reach the same worldline. 



For large distances between the source and observer, a worldline intersects any generic future lightcone several times, 
due to the folds in the individual light-cones produced by space-time curvature (see Fig. |l|). This is the case where 
(Q) is not invertible, since for every fixed value of s J there would be several values of the set (r, 0, 4>) corresponding to 
the same spatial location X % . This means that there are several photons, shot in different directions, that reach the 
observer's location at different times. Therefore, for large distances the function T(X' 1 , s J ) is multivalued. We restrict 
attention to such cases where T(X l , s J ) is single valued. In this case, there exists the following theorem, mentioned 
in the special case of static space-times by Arnol'd (see jjj, p. 251, and jjj, p. 298): 

Theorem. The proper time of arrival, T, at C, is extremized by those rays that leave S perpendicularly to it. In other 
words, the points s J such that 



(0) 



are connected to C by light-rays that are normal to S. 

Proof: The proof is based on the standard variational principle for null geodesies, and is an extension of a similar 
result in H . Consider the action 



I(p, q) = / g ab x a x h dr, 
Jo 



(7) 



where x a = dx a /dr is the tangent vector to an affinely parametrized null geodesic between the points p = x a (0) and 
q = x°(l), and s is the affine parameter. Consider the variation of / constructed by taking the difference between 
two neighboring null geodesies with different initial points, p\ and P2, and different final points, q\ and <?2 Since I 
evaluates identically to zero in both instances, its variation is zero as well, 



= AI = I(p 2 ,q 2 )-I(pi,qi) 



(8) 



The variation is 



AI 



9ab, c x a x b Sx c + 2g ab x a Sx D dr 



({g ab , c - 2g cbta )x a x b - 2g cb x b ) Sx c + 2— (g ab x a Sx b ) dr. 



(9) 
(10) 



Since the curves are null geodesies, the term proportional to Sx c in the integrand vanishes, and we are left with 
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AI = 2 (g ab x a Sx%. (11) 

By (|) and ([□]), we have 

9abi a Sx b \ r=1 = gab± a Sx b \ r=Q (12) 



In (12), 5x b represents an arbitrary (up to the condition that p and q can be connected by a null geodesic) displacement 
at r — and r — 1 between the null geodesic with tangent x b and a neighboring one. We now particularize ( |l2| ) to 
our case of interest, in which, initially, neighboring null geodesies are connected by displacements on the surface S; 
i.e.; 

Sx b \ r=0 = (M) ds J (13) 

dx b . T 

where -^p- are the two coordinate tangent vectors to S and ds is arbitrary. The final displacement must be tangent 
to £, i.e., 

5x b \ r=1 =v b dT (14) 

with v b the tangent vector to the curve C. However, because the two null geodesies arriving at r = 1 and separated 
by Sx b \ s= i must be the same pair of null geodesies leaving r — separated by 8x b \ r= o then dr is not arbitrary, but 



BT 

dr = dT\ xi = gpds J . (15) 



With @, (0) and @, Eq. @ reads 



l ab ± a ^ - g ab x a v b ^-j) ds J = 0. (16) 

Since ds J is arbitrary, and since g a bi a v b can not vanish as long as x a and v b are tangent to a null and a timelike 
curve, respectively, then (|l^) is equivalent to 



9s J \g c di c v d J ds J 

This implies that, at each point of <S, dT/ds J vanishes if and only if the null ray x a is normal to the surface at that 
point. This proves the theorem. □ 

Note that since no property of the line C was used, the theorem can be restated as follows. Given a time foliation 
of a Lorentzian manifold with local coordinates chosen as (r, X 1 ), and a source, a closed two-surface that "lights up" , 
the time r = T(x l , s J ) of arrival at any spatial point X 1 , of light signals from a surface point s J , is extremized by the 
light-rays leaving the surface perpendicularly. 



III. EIKONALS AND THE TIME OF ARRIVAL 



In the following, we provide an alternative method for obtaining the time of arrival function which is based entirely 
on the use of the eikonal equation - with Arnold's generating families - and specifically on knowledge of a two-parameter 
family of solutions of the eikonal equation, 

g ab (x c )d a Zd b Z = 0; (18) 

i.e., it is assumed that a solution, with the two parameters a A = (a 1 , a 2 ) 

u = Z(x a ,a A ) (19) 

to Eq.© is known. Then for each value oia A the level surfaces of Z are null (i.e., d a Z is a null covector). Furthermore 
it is assumed that at each point x c , d a Z sweeps out the entire null cone at x a as a A goes through its range. 
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Remark. We point out and emphasize that the level surfaces of the solutions to Eq.(|18|) though referred to as "null 
or characteristic surfaces" are not strictly speaking surfaces; they can have self-intersections and in general are only 
piece- wise smooth. Though Arnold refers to them as "big- wave- fronts" we will continue to call them null surfaces. 
The intersection of a big wave front with a generic three surface yields a two-dimensional (small) wave front. 

The first thing that we want to show is that the light-cone, £ Xo , from an arbitrary space-time point xg can be 
constructed from knowledge of the function Z of Eq.(|l9|). 

One sees immediately, from Eqs.(|l8|) and (|l9|), that the function 



S*(x a , xg, a A ) = Z(x a ,a A ) - Z(a%, a A ) = (20) 

defines a two-parameter set of surfaces which all pass thru the point xg and which, furthermore, arc all null surfaces. 
The envelope of this family is constructed by demanding that 

d A S*[x a ,x%,a A 





where d A denote the derivatives with respect to the a A . Assuming for the moment that (0 



a = a (x°), then when they are substituted into ( |2C| ) one obtains the function 

S(x a ,x%) = Z(x a ,a A (x a )) - Z{x%,a A {x a )) = 0. 



(21) 

could be solved for the 
(22) 

Using ( plf ) it is easy to see that d a S — d a S* so that again S(x a , xg) is a null surface thru the point xg; its gradient 
at xg, namely d a S = Z(xq, a A ), spans the light-cone at xg [at xg, Eq.(^lj) can not be solved for the a A = a A (x a ); all 
values of a A are allowed.] We thus see that Eq.(p2j) represents the light-cone <t Xo . The assumption that Eq.(pl|) could 
be solved for a A = a A (x a ) depended on the non- vanishing of the determinant J of the matrix = didjS*(x a , xg, a 1 ). 
J does vanish at the singularities of the "surface" S(x a , xg), e.g., at the apex x a = xg. In general, however even when 
J = 0, Eqs.(|2l]) and (pO) can be solved for other variables, namely some set of three (say x a ;which might be different 



in different regions) of the four x a , in terms of the fourth one (say x*) and the a , i.e., 

x a =x a (x%,x*,a A ). (23) 

Note the important point that if the coordinates x a are such that three of them are space-like and one of them is a 
time coordinate, x°, then Eq.(|23|) has a stronger version, namely 



rv>3 rfj ( r r a -y* 

x° = x (xg,x*,a A ) 



(24) 
(25) 

where the two x J and the x* are the three space-like coordinates. That one can solve for the x° = x°(xg, x*, a A ) 
follows from the fact that Eq.(|22|) can always be solved, from the implicit function theorem, for x° since S* satisfies 
the eikonal equation and hence dS* /dx° ^ 

Eqs.(^4|) and ( p5|) are a parametric representation of € Xo via the null geodesies that rule it. For the different given 
values of the a A , they are the null geodesies thru xg. 

We thus have the result that the <t Xo ca,n be given either via the surface (|2|) or by its geodesies (|4|) and (|25|). We 
will return to Eq.(p5|) later. 
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FIG. 2. The construction of the wavefront as the envelope of the individual wavelets. 

If we now allow the Xq to lie on a space- like two-surface S described by Xq = Xq(s j ), parametrized by the two 
parameters s J , then the previous construction of light-cones yields the family of light-cones of all the points of 
S via Xq =4> Xq(s j ). The intersection of the set of all the light-cones with a constant-time slice x° = constant, is a 
family of individual (small, two-dimensional) wavefronts emanating from each point on the surface; they are denoted as 
"Huygen's wavelets" . By Huygen's principle, the envelope of all the wavelets, at x° = constant, is the two dimensional 
wavefront from the source S. (see Fig. ^ The evolution, as x° changes, of these wavefronts yields a new characteristic 
surface (big wave-front). It is equivalent to the envelope of the family of light-cones of all the points of S; the envelope 
corresponding to the stationary variation of the family of light-cones with respect to variations in the s J . 

More precisely, the envelope is the three-surface defined, first by Eqs.(pp|) and d|l]), [the conditions for the light-cones 
from Xq(s j )], i.e., 

S*(x a , x%{s J ), a A ) = Z(x a ,a A ) - Z(x%(s J ),a A ) = 0, (26) 



d A S*(x a ,x%(s J ),a A )=0 (27) 

augmented by the s J variations, i.e., by 

djS*(x a ,x a (s J ),a A )=0. (28) 

These are five conditions on the eight variables (x a ,s J ,a A ) thus forming a three-surface in the eight dimensional 

space; this when projected down to the space-time results in the aforementioned envelope. It is easily seen from 
Eqs.(g^) and (p8| ) that this surface, which we will denote by 

N{x a ) = (29) 



is a characteristic surface and hence satisfies the eikonal equation, (|18|). Though almost everywhere it can be given in 
the form of the vanishing of a function of x a , i.e., by Eq.( [29|), there will be lower dimensional regions where it must 
be given parametrically. See e.g., Eq.(|2|) or Eqs.(|2J) and (|25p. 

Before looking at the time of arrival function, we first look at Eq.(p8|) more closely. Substituting Eq.(|26|) into 
Eq.(p8|) and taking the required derivatives we have 

r>T a 

d a Z(x a (s J ),a A )^ = Q (30) 



which is the statement that for the null ray leaving S at the point Xq(s j ), d a Z(xQ(s J ),a A ) must be normal to the 
tangent vectors §frat S and thus normal to S. 1 
Xq(s j ), i.e., the rays parametrized by a A , just t 
thus have the result that ( Sol) can be solved by 



tangent vectors §frat S and thus normal to S. Eq.(|30|), hence, chooses among all the rays forming the light-cone at 
Xq(s j ), i.e., the rays parametrized by a A , just the appropriate a A so that the ray is the (unique) normal to S. We 



a A = a A (s J ). (31) 
Using Eq.(|3l|), we have that Eqs.(p6|) and ( p7| ) become 

S*{x a 1 x^s J ),a A (s J )) = Z(x a ,a A (s J )) ~ Z {x a {s J ) , a A {s J )) = 0, (32) 



d A S*(x a ,x a o ( S J ),a A )\ a A =aA{sJ) =0 



(33) 



Using the same argument that led to Eq. (|25|) , namely the implicit function theorem and dS*/dx° ^ 0, we see that 
Eq.(p2[) is equivalent to 



x° =T{x a ,x a (s J ),a A (s J )) 



(34) 



If we take the three x a = X a as the "constant spatial position" of the world-line of Sec. II, we have the time of 
arrival function. Since, interpreting Eq.(B3) as defining Eq. (p4|) implicitly, we have that 



dS* 



djT + djS* = 0, 



(35) 



G 



which, since dS*/dx° ^ 0, implies that 



djS* = => djT = 0. 



(36) 



Thus the extremization of S* implies the extremization of T(x a , Xq(s j ), a A (s J )) as was to be proved. This proof is 
not affected by the difficulties in Sec. 11 of the possible multivaluedness of the earlier T. 

In the terminology of Arnold, these results follow from his theory of Legendre submanifolds and maps, where 



We have given two derivations of a variational principle for the time of arrival of null geodesies at an observer. 
Superficially, it appears as if it were a version of Format's principle; in actuality it is quite different. Fermat's principle 
leads to local evolutionary laws for the rays while here we have from the start assumed that the rays are given by 
null geodesies. Our variational principle gives the initial direction of the ray. Often Fermat's principle is invoked to 
derive the equations of gravitational lensing |^,^| . A paper is now in preparation, using the techniques discussed here, 
in which a universal lensing equation valid in all situations is obtained. 
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from Eq. (|26]) , defines a generating family F(x a , a 




(37) 



IV. DISCUSSION 
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